F0 Analysis and Modeling for Cantonese Text-to-Speech
نویسندگان
چکیده
This paper presents a study on the control of fundamental frequency (F0) in Cantonese text-to-speech (TTS) systems. The surface F0 contour of an utterance is considered as the combination of tone-related local components and phrase-level long-term variation. A novel method of F0 normalization has been developed to effectively separate them. Statistical analysis is performed for the phrase curves and the tone contours extracted from a large speech corpus, and the results are summarized into regular patterns. These patterns are used as the basic templates in a non-parametric F0 model, from which utterance-level F0 contours can be generated. Perceptual test shows the naturalness of speech naturalness is significantly improved by the new F0 model. The MOS increases by 0.65 over a five-point scale.
منابع مشابه
Acoustical F0 Analysis of Continuous Cantonese Speech
This paper presents a preliminary study on acoustical analysis of fundamental frequency (F0) in continuous Cantonese speech. By understanding how the surface F0 contour is determined by many co-functioning and inter-playing linguistic or non-linguistic factors, our ultimate goal is to facilitate automatic F0 prediction for highly natural text-to-speech synthesis. A novel method of F0 normalizat...
متن کاملPerceptual equivalence of approximated Cantonese tone contours
This paper describes a perceptual study on approximated Cantonese tone contours. We believe that the perception of tone contours relies mainly on the major trend of pitch movement, and is not sensitive to the exact F0 values at particular time instants. The tone contours of individual syllables and the transition between them are approximated as a small number of linear movements. The effect of...
متن کاملAnalysis and Synthesis of Cantonese F0 Contours Based on the Command-response Model
Cantonese is a well-known Chinese dialect with a quite complex tone system. We have applied the command-response model to represent F0 contours of Cantonese speech by defining a set of appropriate tone command patterns. In this paper, the analysis is extended to Cantonese utterances at three different speech rates. By incorporating the effects of tone coarticulation, word accentuation and phras...
متن کاملAnalysis of F0 contours of Cantonese utterances based on the command-response model
As a major Chinese dialect, Cantonese is well known for its complex tone system. This paper applies the commandresponse model to represent the F0 contours of Cantonese speech. Analysis-by-Synthesis is conducted on both utterances of carrier sentences and utterances with less constrained structures, from which a set of appropriate tone command patterns is derived. By intrinsically incorporating ...
متن کاملPerception-based automatic approximation of F0 contours in Cantonese speech
In our previous studies, it was found that F0 variations in Cantonese speech can be adequately represented by linear approximations of the observed F0 contours, in the sense that comparable perception with natural speech can be attained. The approximated contours were determined manually. In this study, a framework is developed for automatic approximation of F0 contours. Based on the knowledge ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004